A Proposition Bank of Urdu
نویسندگان
چکیده
This paper describes our efforts for the development of a Proposition Bank for Urdu, an Indo-Aryan language. Our primary goal is the labeling of syntactic nodes in the existing Urdu dependency Treebank with specific argument labels. In essence, it involves annotation of predicate argument structures of both simple and complex predicates in the Treebank corpus. In this paper, we describe the overall process of building the PropBank of Urdu. We discuss various statistics pertaining to the Urdu PropBank and the issues which the annotators encountered while developing the PropBank. We also discuss how these challenges were addressed to successfully expand the PropBank corpus. While reporting the Inter-annotator agreement between the two annotators, we show that the annotators share similar understanding of the annotation guidelines and of the linguistic phenomena present in the language.
منابع مشابه
Politeness Orientation in Social Hierarchies in Urdu
The present research is aimed at investigating how the politeness of the speakers of Urdu is influenced by their relative social status in society. The researcher took politeness theory of Brown and Levinson (1978, 1987) as a model. To observe politeness of Urdu speakers, speech act of apology with different strategies was selected. A Discourse Completion Task (DCT) was used as an instrument to...
متن کاملThe dilemma of Rationality or Providing Efficiency in Monetary Policy Making: An Application of Arrow’s
Financial frictions inducted in the model is a new contribution to monetary economics. Herein, an analytical tool arranges monetary policymaking in the form of two steps procedure. In the first step, an appropriate amount of money supply should be assessed; and in the second step, that appropriate amount should be allocated to several sectors. The Central Bank obligates the step of assessment a...
متن کاملAutomatic Semantic Role Labeling for Chinese Verbs
Recent years have seen a revived interst in semantic parsing by applying statistical and machinelearning methods to semantically annotated corpora such as the FrameNet and the Proposition Bank. So far much of the research has been focused on English due to the lack of semantically annotated resources in other languages. In this paper, we report first results on semantic role labeling using a pr...
متن کاملImproving Chinese Semantic Role Labeling with English Proposition Bank
Most researches to SRL focus on English. It is still a challenge to improve the SRL performance of other language. In this paper, we introduce a twopass approach to do Chinese SRL with a Recurrent Neural Network (RNN) model. We use English Proposition Bank (EPB) to improve the performance of Chinese SRL. Experimental result shows a significant improvement over the stateof-the-art methods on Chi...
متن کاملDevelopment of Tree-bank Based Probabilistic Grammar for Urdu Language
The process includes in hand tagged corpus, tree annotation on paper for large corpus, NU-FAST Treebank in form of brackets, extraction of CFG through NU-FAST Treebank, evaluation of PCFG from CFG and then PDCG from PCFG for inspection/testing through PROLOG parser.
متن کامل